Text Line detection and Segmentation in Handwritten Gurumukhi Scripts

نویسندگان

  • Namisha Modi
  • Khushneet Jindal
چکیده

Gurumukhi script is a two dimensional composition of symbols with connected and disconnected diacritics. Handwritten Gurumukhi script has some complexities like connected, overlapped text lines. It is one of the major reasons for errors during the recognition process. Text line segmentation is a challenging job in unconstrained writer independent handwritten document image processing. There is a great need for research in the area of Punjabi handwriting recognition to resolve challenging problems involved in it. In this paper we have proposed an algorithm for text line segmentation in handwritten Punjabi document that deals with the problems like overlapped and connected components in text line and extract text lines from handwritten document image. The text line detection algorithm is based on locating the most favourable segments of text line and associating it with its respective text line inserting a gap between neighbouring text lines. Keywords— Text line segmentation, overlapped text lines, connected text lines, average height, Gurumukhi script

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Review: A Literature Survey on Text Segmentation in Handwritten Punjabi Documents

Gurumukhi script is used for Punjabi language, which is a two dimensional composition of symbols with connected and disconnected diacritics. Handwritten Gurumukhi script has some complexities like connected, overlapped text lines, words and characters. It is one of the foremost issues for errors during the recognition process. Text segmentation is a challenging job in unconstrained writer indep...

متن کامل

A New Algorithm for Detecting Text Line in Handwritten Documents

Curvilinear text line detection and segmentation in handwritten documents is a significant challenge for handwriting recognition. Given no prior knowledge of script, we model text line detection as an image segmentation problem by enhancing text line structure using a Gaussian window, and adopting the level set method to evolve text line boundaries. Experiments show that the proposed method ach...

متن کامل

An Approach to GUI Identification for Printed Gurumukhi and English Text

Optical Character Recognition system is used to recognize printed and handwritten alphanumeric text from input image. A numerous of methods have been published based on optical character recognition. In proposed work expansion of optical character recognition to recognize multi-scripts is done which in infancy. Such type of expansion is crucial in India where each state has diverse language. Th...

متن کامل

Printed Text Recognition System for Multi-Script Image

Optical Character Recognition system provides transformation of input text into editable form. Multi-script recognition systems are requisite in the countries like India where different people speak different languages in numerous states of country. In the recent time, multi-script recognition is a demanding problem and research work for expansion of optical character recognition scheme for cla...

متن کامل

Problems and Review of Line Segmentation of Handwritten Text Document

Optical character recognition (OCR) is a very popular research area since 1950's. Many people has done a lot of work on various scripts. Line segmentation is a very important step in OCR as the accuracy of the recognition algorithm highly depends on the correct line segmentation. Incorrect line segmentation not only decreases the accuracy but also may lead to some other errors. The objective of...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013